1 

Information Cascades with Noise 

Tho Ngoc Le, Vijay G. Subramanian, and Randall A. Berry 


Abstract 

Online networks enable agents to better observe the behavior of others and in doing so potentially 
learn from their actions. A key feature of models for such social learning is that information cascades 
can result, in which agents ignore their private information and blindly follow the actions of other agents. 

This paper considers the impact of noise in the form of observation errors in such a model. Similar to 
a model without noise, we show that with noise, both correct and incorrect cascades happen, with the 
same level of fragility. However, with noise, it is harder to overturn a cascade from one direction to 
the other. Using Markov chain analysis, we derive the net welfare (payoff) of each agent as a function 
of his private signal quality and the error rate. We show, somewhat surprisingly, that in certain cases, 
increasing the observation error rate can lead to higher welfare for all but a finite number of agents. In 
such cases, we compare and contrast adding additional noise with simply withholding observations from 
the first few agents. Our analysis assumes that all erroneous observations are available on a common 
database; however, we also discuss relaxing this assumption. We conclude by discussing the impact of 
different tie-breaking rules on the probability of wrong cascade, and the impact of bounded rationality 
on the agents’ welfare. 

Index Terms 

Perfect Bayesian Equilibrium, Bayesian Learning, Information Cascades, Herding. 

I. Introduction 

Consider a recommendation system where agents sequentially decide whether to buy an item, 
for which they have some prior knowledge of its quality/utility. Agents’ decisions are reported to 
a common database that is available to all later agents (e.g., via a website), who can potentially 
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benefit from the information obtained by observing their predecessors’ choices. The study of 
such a model was initiated in the seminal papers [9], [10], and [11], which cast this in an 
observational Bayesian learning framework. In these models, each individual (agent) has some 
prior knowledge or signal about some payoff-relevant state of the world generated according 
to a commonly known probability distribution. Each agent makes a one-time decision in an 
exogenous order and his action is recorded on a common database available to the public. From 
this database, each agent then observes exactly the decisions made by all previous agents. Given 
these observations and their own signals, agents are assumed to be Bayesian rational, i.e., they 
sequentially choose the action that reflects their posterior beliefs about the state of the world. To 
put this in game-theoretic jargon, the setting is equivalent to a dynamic game with asymmetric 
information. The equilibrium analyzed in these problems is a Perfect Bayesian Equilibrium 
(PBE), which uses the common information based belief to determine the strategies 1 . 

We highlight two possible outcomes for such a model. First, there is learning. This is defined 
as when the information in private signals is aggregated efficiently, and agents eventually know 
the underlying true value of the item and make socially optimal decisions. Second, there is 
cascading. An information cascade occurs when it is optimal for the agents to ignore their own 
signals when taking actions. Though individually optimal, this may lead to the agents making 
a choice that is socially sub-optimal, which we refer to as a “wrong cascade.” In addition to 
the possibility of cascading to the wrong conclusion, an information cascade results in a loss of 
information about the private signals held by all the agents following the onset of cascade. 

In models where a homogeneous population of agents with discrete, bounded private signals 
is assumed (e.g. [9]-[ll]), a main result is that a cascade happens in a finite number of agents 
with probability one. This leads to a positive probability of cascading toward the wrong choice. 
Once a cascade occurs, all subsequent agents follow suit, given the underlying homogeneity. 
The ultimate consequence is that all private signals from the onset of cascading are lost; thus 
learning stops. 

In this paper, we consider a similar Bayesian learning model as in [9] -[1 1] except we introduce 
obsen’ation errors. More precisely, we assume observation errors occur when each agent’s action 
is recorded for all subsequent agents to see, with the recording subject to error, with the statistics 


'See [25] where a formal proof is presented. 
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of the error process known to all agents. The motivation of having observation errors in our model 
comes from the following reasons. First, real-world agents can make mistakes. As an example, 
this could model a setting where agents are asked to report their decisions on a website and agents 
occasionally misreport (e.g., by simply pressing the wrong button). In communication networks 
settings, observation errors can also occur as a form of broadcast failure in feedforward networks 
([29]). Indeed, even highly sophisticated decision makers such as investors ([28]) and consumers 
([27]) can make mistakes in choosing (binary) actions (investing in stocks or purchasing goods). 
Second, observation errors could come from the mistakes incurred by learning algorithms based 
on web-log data. For example, consider scenarios when a website uses the customers’ web-log 
records to infer the customers’ purchasing decisions and then posts such decisions on a database; 
the errors can occur from both the data and the algorithms used for such inference (e.g., a user 
could buy an item from a different retailer or could buy it and then return it elsewhere). Finally, 
this observation error could also result from either strategic agents ([32]), or a social planner 
([31]), who can manipulate the recording for their own benefit. 2 Even in scenarios when there is 
no clear incentive for benefit, agents can choose not to express their true opinions, e.g., IMDB 
review manipulations ([37]). We study the effects of these observation errors on the probabilities 
of cascades and the welfare obtained for any given agent, and also asymptotically as the number 
of agents increases without bound. 

A. Contributions 

Our work has several contributions. First, we develop a simple Markov chain model that we 
use to analyze the probabilities of cascades and the expected pay-off of each agent as a function 
of the error-rates and signal quality. We then study the effect of changing the noise level on both 
the asymptotic behavior and the behavior for an arbitrary agent by using stochastic ordering and 
coupling methods for Markov chains. Our results demonstrate a counter-intuitive phenomenon: 
for certain parameter settings, the asymptotic average payoff can be increased by increasing the 
noise level. The extent of this phenomenon and the amount of noise to be added depend on the 
agents’ signal quality and the total amount of noise already present. 

2 In the case of strategic agents manipulating reviews, our notion of welfare would need to be modified to account for the 
preferences of these agents. 
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Second, we extend our conclusions by considering which agents benefit from adding noise 
in the low noise regime. Interestingly, we show that if one agent benefits, the following agent 
may not. However, we show several properties of the sequence of benefiting agents, and further 
show that if one agent benefits at most a finite number of subsequent agents will not benefit. 

Third, we compare and contrast the trade-offs of two approaches for improving the agents’ 
welfares: 1) Adding observation noise (whenever it is beneficial to do so); and 2) Forcing the 
first few agents to be “guinea pigs," i.e., to follow their private signals as in [19]. We see that 
the second method leads to higher asymptotic pay-offs, but at the cost of lower pay-offs for an 
initial group of users. We also suggest a hybrid method that for certain model parameters can 
significantly reduce the number of required guinea pigs, while still achieving better asymptotic 
welfare than either approach separately. 

Fourth, we also discuss the role of a common database in our analysis. We argue that such 
a database is crucial in enabling the agents being able to to infer the exact time in the history 
where a cascade happens. In particular, we compare the common database setting to one where 
each agent’s observation is an i.i.d. erroneous version of the sequence of the true actions taken 
before. In such case, we show that a cascade may not persist once it starts. Moreover, this is 
shown to greatly increase the complexity of updating beliefs for each agent, making the Bayesian 
assumption more questionable. 

Finally, our work also highlights the advantage of Markov Chain methods in obtaining a refined 
and detail analysis of the problem, in particular, obtaining per agent payoffs and determining 
sensitivity to parameter changes. This should be contrasted with the most of the literature, which 
uses martingale methods to discuss asymptotic performance measures. 

The remainder of the paper is organized as follows. We start by discussing related work in 
Section I-B. In Section II, we state our model. In Section III, we analyze the Bayesian updates 
for this model and give our Markov chain formulation. We then turn to studying the impact of 
changing the noise level in Section IV. In Section V, our analysis of when adding noise is better 
is presented, for both cases of asymptotic and individual welfares. We compare adding noise 
to the “guinea pigs” model in Section VI, and then consider the case of no common database 
in Section VII. In Section VIII, we present some discussions of other changes to the model 
including different tie -breaking rules, and the effect of agents’ bounded rationality. We conclude 
in Section IX. 
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B. Related work 

In models where information is aggregated independently and efficiently, e.g. [22], learning 
can be achieved. However, the results in [9]-[ll] showed that pathological outcomes, such as 
cascading/herding can occur in scenarios where decisions are taken sequentially and agents can 
observe previous actions. In the following, we organize the related literature in an attempt to 
answer the questions: why does a cascade happen? or when can learning occur? 

First , learning fails because the discrete feedback, i.e. previous actions, from the agents is not 
sufficiently rich. This is connected to early models of sequential detection/hypothesis testing, 
e.g., [2] and [3]. In these works, a hypothesis is tested at each stage based on a past feedback 
and a new observation. A new feedback is then encoded and passed on to the next stage. In 
[3], the authors showed that if a discrete feedback is used with a regular encoding-decoding 
procedure, then the probability of choosing the wrong hypothesis is bounded away from zero. In 
terms of cascading jargon as in [9]-[ll], this means the probability of wrong cascade is positive. 
Instead, by changing the encoding-decoding algorithm, the authors in [2] showed that learning 
is achieved using a feedback encoded in only two bits of information. 

Second , as showed by Smith and Sorensen in [17], the boundedness of the likelihood ratios of 
the private signals is another reason why learning with the regular encoding-decoding procedure 
fails. Using both martingale techniques and Markovian analysis, Smith and Sorensen showed 
that with unbounded likelihood ratio, any deep cascade can be stopped and learning toward the 
correct action can be achieved. Otherwise, there is a positive probability that learning fails and all 
but a first few agents cascade to the less profitable action. In addition, these authors considered 
a richer assumption where there are two types of agents with the types unknown. Assuming that 
the distribution of the types is common knowledge, this leads to another type of outcome known 
as “confounded learning”, where from some point onward the history offers no information and 
subsequent agents must rely on their private signals. 

Third , another reason is that Bayesian update plus constant threshold rule may not be optimal 
to induce learning. In [2], the author proposed a time-variant encoding rule for finite memory 
approximation that make learning possible. Similar conclusions also hold for more general 
networks (e.g., [29]), or for a random decision rule (e.g., [35]). In these works, it is not clear 
whether it is in the agents’ best interest to strictly adhere to such set of rules; therefore the 
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outcomes might not constitute an equilibrium. Here, we instead consider a strategic setting 
where each agent acts in his own benefit, and the agents are in a PBE. In particular, we limit 
ourselves to binary feedback in the form of the agent’s actions and assume that each agent 
deterministically chooses his action to optimize his own pay-off (i.e., there is no planner). 

Finally, the richest venue for inducing learning, as being explored by the literature, is by 
changing the information structure from the simple models in [9]-[ll]. In the rest of this section, 
we discuss works that focus on this approach. In [25], the authors also considered Bayesian, 
observational learning with bounded and unbounded signal quality (similar to [17]). However, 
each agent only gets to see the actions of a random subset (with some underlying distribution) 
of the agents that acted before. In particular, the authors provided the conditions on the network 
formed by sampling past actions that would support learning. If private signals are unbounded, 
learning is achieved if such network satisfies “expanding observation” 3 . Moreover, even when 
the private signals are bounded, if the network allows information to be collected and passed to 
later agents, learning is also achieved. 

Another work on observational learning (with both Bayesian and non-Bayesian settings) when 
the signals are bounded, and the network is also sampled, was presented in [30]. However, the 
sampling scheme is deterministic as the authors assumed that each agent observes only a fixed 
K of his immediate predecessors. In the Bayesian setting where the agents myopically maximize 
their expected payoffs, learning almost surely fails for all K while it is achieved in probability 
only for K > 2 by using a sophisticated encoding-decoding procedure, as in the spirit of [2]. On 
the other hand, for the non-Bayesian setting where agents are forward looking and maximize 
the discounted sum (over all agents) of the probabilities of a correct decision, learning almost 
surely fails for all values of K. 

Also in the line of sampling of the observation history, in [36] the author instead studied a 
social learning model where agents move in endogenous order. In this setting, the agents have 
to pay a cost to, strategically, choose their observation networks. Under these assumptions, the 
model showed that unbounded private belief is not necessary for asymptotic learning as long as 


3 According to these authors, an expanding network means agents avoid observing exclusively any subset of other agent, thus 
allow the arrival of new information from other sources in the network. 
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agent are not limited to seeing only a finite number of observations 4 . Similar to [25], the author 
characterized the pure-strategy PBE for which learning occurs. 

In another strand of work where the information structure is modified, an attempt to improve 
learning that aims to achieve maximum social-welfare is by sacrificing a number of agents as 
per the sequential experiment policy for the multi-armed bandit problem suggested by [7], [13]. 
Based on such experiments, we can more accurately solving the hypothesis test problem and 
use the result for remaining agents. An example in this line of work was suggested by Sgroi in 
[19], where the sacrificial victims are called the “guinea pigs”. 

In Sgroi’s paper [19], the author considered a similar model to [9], assuming a population with 
finite number of agents. Suppose that there is a social planner who can force the first few agents, 
the “guinea pigs,” to make decisions based only on their private signals. As a result, subsequent 
agents, who benefited from collecting the additional private signals from those guinea pigs, could 
make a decision that leads to higher expected welfare. Thus, as more guinea pigs are sacrificed, 
more useful private signals are collected, and a closer to the socially optimal decision is achieved 
in the long run. The author also found the optimal number of guinea pigs that would maximize 
the total social welfare in a finite population. From a mechanism design viewpoint, however, the 
model in [19] omitted a discussion on the selection criteria of the potential guinea pigs and their 
corresponding fairness measures. Moreover, in a population with n agents, the optimal number 
of guinea pigs needed is OiXogn) which increases without bound ([13]). 

Another modification to the information structure includes learning with continuous action 
spaces. Given that every agent is endowed an informative private signal about the state of the 
world, learning could be achieved if these signals are appropriately collected. Pathologically, 
the occurrence of herding shows how such information is poorly aggregated ([15], [17], [12]). 
In [12], the author showed that herding can be prevented if there exists an action set that is a 
one-to-one mapping with the posterior of each agent. In other words, by allowing a continuum 
action space, each agent can fine-tune his action to the posterior. However, in practice, the 
assumption that every action can be chosen from a uniformly fine-grained set is rather strict 
for most decision making scenarios. Our paper and most of other works on the topic, instead, 
assume that the action space is discrete. 

4 This satisfies the “expanding network” condition as defined in [25] 
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In this paper, we assume a bounded signal quality, where types of agents are homogeneous, and 
a simple sampling network with all past actions being observable. However, our paper considers 
a different approach toward modifying the information structure. In addition to the asymptotic 
results, we also study finer per agent characterization and the variational characterization (as 
parameters change) that the Markovian analysis and coupling techniques afford. In particular, 
we relax the assumption of perfect observability of the action history and consider a similar 
Bayesian learning model as in [9] -[11] except with the introduction of noise in the form of 
obsen’ation errors. We study the effects of such errors on the welfare obtained asymptotically 
as the number of agents increases without bound. More precisely, we assume observation errors 
occur when each agent’s actions are recorded on a common database, but this record is subject 
to errors, with the statistics of the errors process known to all agents. 

II. Model 

We consider a model similar to [9] in which there is a countable population of agents, indexed 
n = 1,2,... with the index reflecting both the time and the order in which agents act. Each 
agent n chooses an action A„ of either buying (F) or not buying ( N ) a new item. The true value 
(V) of the item can be either good (G) or bad (B); for simplicity, both possibilities are assumed 
to be equally likely. 5 

The agents are Bayes-rational utility maximizers 6 whose payoff structure is based on the 
agent’s action and the true value V. If an agent chooses N, his payoff is 0. On the other hand, if 
he chooses F, he faces a cost of C = 1 /2 and gains one of two amounts depending on the true 
value of the item: his gain is 0 is V - B and 1 if V = G. The total pay-off of an agent choosing 
F is then the gain minus the cost. Thus, the ex ante expected payoff of each agent is 0. 



Figure 1: The BSC over which agents receive signals. 


5 A discussion of how to generalize this is provided in Section VIII-C . 

6 This assumption leads to a PBE that uses the common information based belief to determine strategies as proved in [25]. 
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To reflect the agents’ prior knowledge about V, each agent n receives a private signal S„ e 
{1 (high),0 (low)} through a binary symmetric channel (BSC) with crossover probability 1 — p, 
where 0.5 < p < 1. (See Fig. 1.) In other words, we have 

P (S n = 1|V = G) = P(S„ = 0|V - B) = p, and 
P (S n = 0\V = G) = P (S n = 1\V = B) = 1 - p. (1) 

Thus, the private signals are informative, but not revealing. We modify the information struc- 
ture in [9] by assuming that later agents’ observations are noisy versions of their predecessors’ 
actions. Specifically, we assume that each agent reports his action to a public database which 
is available to all successors. The errors in this process are modeled by passing every action A, 
through another BSC with crossover probability e e (0, 1/2). This means with probability 1-6, 
the reported action of agent i, G„ satisfies G, = A„ and with probability e, G, = A,, where A, 
is the opposite action of A,. This assumption reduces the dependence of every agent’s decision 
on the predecessors’ choices and drives him toward using his own signal. With this in place 
we assume that each agent n takes a one-time action A„ based on his private signal S „ and the 
(noisy) observations Oy , . . . , G„_i of all previous agents’ actions A { , . . . ,A„_i. 

III. Bayesian updates, cascades and error thresholds 

A. Bayesian updates 

The optimal action for the first agent is always to follow his private signal since no observation 
history is available. Starting from the second agent, every agent n considers his private signal 
S n and the observations Oy , ..., G„_i. Let the information set of agent n be \S n /H„ |} where 
7 H n -\ = {Gi, ..., G„_i} is the observation history. Based on {S agent n will update his 
posterior probability (using Bayes formula). 

Definition 1. The posterior probability of agent n, y n , is defined as y n (.S n , 7F„_i) = P[V = 
G\S n ,9d n -yl 
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Assuming common rationality 7 , the decision rule for agent n is as follows: 8 

Y, if y n > 1/2, 

A n = <N , if y n < 1/2, (2) 

follows S n , if y n = 1/2. 

Following [17], we can also state this decision rule in terms of a private and public likelihood 


ratio defined next. 


Definition 2. The private likelihood ratio of agent n, f3 m is defined as fi n (S „) = P[S„| V = 
B]/nS n \V = Gl 

Note that the private likelihood ratio is a function of the private signal, and we have /3„(0) = 
p/( 1 - p),/3„( 1) = (1 - p)/p- For a given p e (0.5, 1), these private ratios are strictly bounded in 
(l,oo) and (0, 1), respectively. 

Definition 3. The public likelihood ratio after agent n decides, £ n , is defined as £ n fH n ) = 
PpHJV - B\/T['H„\V = G]. This is available to all subsequent agents n + l,n + 2, . . . 

A simple application of Bayes formula gives y„ = -f - — . Therefore, using the above formulae 
for ff, and £ n _i, the rule in (2) can be applied to the resulting y„, which amounts to comparing 
to 1, e.g., with Y being the choice whenever [3 n ( n -\ < 1- In this decision-making process, 
cascading is defined as follows: 

Definition 4. An information cascade is said to happen when an agent chooses some fixed action 
regardless of his private signal. 

In particular, agent n cascades to the action Y(N) if y„ > 1/2 (< 1/2) for all realizations of 
S n e {0, 1}. In other words, agent n cascades to Y if /?„( 0)4-1 < 1 (i-e., £ n -\ < (1 ~ p)/p)), and 
cascades to N if /?„(l)4-i > 1 (or 4-i > p/( 1 - P))- It was shown in [9] that when agents are 
rational and observations are perfect (i.e., O n = A n , V/7 = 1, 2, . . .), an information cascade occurs 
in finite time with probability one. In our paper, we will show that this still holds. Another related 


7 As in the PBE in dynamic games. 

s When equality holds, our decision rule differs from [9], where it is assumed that indifferent agents randomly choose one 
action. Our assumption not only simplifies the analysis but also proves to be the best tie-breaking rule (see Section VIII-A). 
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phenomenon, herding , happens when agents choose to copy the actions of others irrespective 
of the realizations of their signals. In a model with a homogeneous population and discrete, 
bounded private signals (e.g. [9]-[ 1 1] and our paper), the two concepts cascading and herding 
are equivalent. Thus, from now on we will use the two interchangeably. 

B. Cascading properties 

In this section, we outline some basic properties of cascading with observation errors. These 
naturally extend properties for the noiseless case shown in [9], [19], so we omit detailed 
derivations. For completeness we provide complete proofs in Section VIII-C. 

Property 1. Until cascading occurs, agent n’s Bayesian updates depend only on his private 
signal S n , and the difference in the numbers ofY’s and N’s in the observation history r H n -\- 

In other words, given the common observation database 77„_i, this difference of T’s and 
N’s in 77 „_ 1 is a sufficient statistic for the observation history of agent n. This follows from 
the symmetry of the signal quality and the errors, which enables opposite observations to be 
“cancelled.” 9 

Note that once a cascade happens, any new observations do not provide new information about 
the true value V, thus the public likelihood ratio stays the same. On the other hand, if an agent 
n does not cascade, all agents 1,2, ...,n act independently based on their own private signals. 
Therefore, until a cascade happens, the public likelihood ratio l n depends only on the difference 
in the numbers of T’s and N’s in the observation history 77„ . Specifically, let h n be the number 
of T’s minus the number of N’s in 77,, (prior to a cascade), then l n - (^) ", where for all 
i = 1 , 2 , . . . , 77: 

a = P[G,- = T|V = G] = P[<9, = N\V = B ], (3) 

gives the probability that a given observation agrees with the optimal action for an agent that is 
not cascading. We can show that a = f(e,p ) e (0.5, p), where: 

f(x,y)±x(l-y) + (l-x)y. (4) 


9 This also extends to the case when V is not equally likely G or B as shown in Section VIII-C. 
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Property 2. Once a cascade happens, it lasts forever. 

Once started, a cascade makes agents stop using their private signals and and so they provide 
no information to the successors. The successors are left in the same situation with the optimal 
choice of the first one who started the herd. We stress the importance of having the common 
database available to all agents. With such information, every agent updates h n and knows the 
exact time when a cascade happens. All other subsequent observations from the onset of a cascade 
will be ignored. Thus, there is no need to further update h„. Next, we provide a condition for 
determining the onset of cascades given the signal quality p and observation error e. 

C. Error thresholds 

Without noise (e = 0), it was shown in [9] that cascading starts with the first agent n to have 
\h n -\\ = 2, i.e., the first agent to observe one action two times more than the other. However, as 
6 increases, each observation provides less information, which in turn can increase the value of 
\h n \ needed to start a cascade. This is characterized in the following lemma. 

Lemma 1. Let a = (1 -p)/p. For any k >2, define the increasing sequence of thresholds \ e k \f =2 , 
where the k th threshold e* k is given as: 

e k = ( 1 - a k l j / (1 - a k -' + - a) ■ (5) 

Define Ik- [ej , e£ +1 ) as the k th e-interval. Then for e e I k , any agent n starts a cascade as 
soon as \h n -i\ = k. 

The proof follows from direct calculation of y„; a detailed version is given in the Appendix. 
Note also that the parameter a here is the same as /?„( 1). Fig. 2 shows the thresholds e* k for 
different values of k and p. For k = 2, this lemma yields e, = 0, which is the case of noiseless 
observations as in [9]. 
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Figure 2: Thresholds for e 


A direct consequence of this lemma is that we can deduce k from a given e as: 

k = [ lo g(l-a)/a«J + !• ( 6 ) 

Here, recall that a is a function of e. Thus, given the signal quality p, the observation error 
e e I k and the observation history {0\, 0 2 , . . . , an agent n can track the difference /i„_i 

and declare a N cascade as soon as /?„_ i = -k, and a Y cascade as soon as h n -\ = k. In the 
limit of e — > 0.5, we have k — » oo, and thus agents only use their own signals and herding never 
starts. In this case, no information is passed through each observation. Therefore, no learning 
occurs either. 

Also note that e* k is a decreasing function of p. Therefore, the more accurate the private signal, 
the sooner cascades may start for a fixed e. In particular, Fig. 2 shows that if the private signal 
quality is high, with an intermediate level of noise a cascade may occur very early. 

In the models without error ([9]-[ll]), the authors also discussed how fragile a cascade can 
be. In fact, for the noiseless case, an information cascade occurs merely on the agreement of 
the running count of an action exceeding the other by two. Assuming that any of the subsequent 
agents is endowed with an additional private signal that disagrees with the cascading actions, 
the cascade stops. This provides some intuition to why many asset markets are volatile, or why 
changes in fashions happen frequently and unpredictably. With noisy observations, our results 
show that information cascades have the same level of fragility as in the noiseless case. In fact, 
for e e I k , it would require only one additional opposite private signal to reduce the difference 
to < k, so that the cascade stops. However, with noise, k can exceed 2 and it is harder to 
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reverse a cascade to the opposite direction: one would need a longer sequence of agents with 
private signals indicating the other action choice. Therefore, given a realistic setting where the 
information is noisy, it may take a long time to reverse certain trends. 

D. Markovian analysis of cascades 

By symmetry, first consider the case V = G. 10 By Property 1, a non-cascading agent n’s 
observation history can be summarized by h n . Thus, viewing each agent as a time-epoch, the 
agent’s observation history can be represented as a state of a discrete-time Markov chain (DTMC). 
Each state i represents values of h n _\ that agent n may see before making his decision. Note 
that the first agent starts at state 0 since no observation history is available. 

For the rest of the paper, assume e e I k so that an agent n will start a cascade if and only 
if \h n -\\ = k. Thus, to model cascades, we simply make the states ±k absorbing, and so are left 
with a finite state DTMC. The two events N cascade and Y cascade translate into hitting the 
left (-k) and the right (k) walls (or absorbing states), respectively. The probability of moving 
one step to the right is the probability that one more Y is added to the observation history, i.e., 
a = P [O n = y|V = G] = f(e,p ) > 0.5. Likewise, the probability of moving one-step to the left 
is 1 - a. Hence, this DTMC is a simple random walk as shown in Fig. 3. 


1 1-a 1-a 1-a 1-a 1-a 



Figure 3: Transition diagram of the random walk when V — G. 


The state transition matrix of this DTMC 

is 

given by 
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... 0 
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Q = 
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0 
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... 1-a 

0 
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0 

0 

0 

... 0 

0 

1 


(7) 


10 If V is not equally likely G or B, the dynamics of the Markov chains for this analysis are unchanged since these are 
conditioned on the true V. Moreover, the (unconditional) probabilities of cascades and expected welfare can be easily adjusted 
according to the prior knowledge of V. See details in Section VIII-C. 
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Since a > 0.5, Q is a row stochastic matrix with a drift to the right. We will use methods 
in [1] to calculate the probability of being at either absorbing state at any given time; since the 
agent indices correspond to the time index, this yields the probability of cascade for each agent. 
Assume that the process starts at state i. Let u* in ,v* n be the probabilities of being at the left 
wall, —k, and the right wall, k, at the n th step, respectively. Let y,-,, be the probabilities of 
hitting the left wall and the right wall for the first time at the n th step, respectively. Obviously, 
u *o n ~ v o n = 0 for 1 < n < k - 1. By [33], u 0m = v 0 ,„ = 0, for n-k odd, thus for n > k: 


*0 ,n 


= Z 
= Z 


n 

m—k 

n 


U -k,n-m U 0,m ~ 

, ^ k,n—m^(\ m ^ j 


m=k 


M(J / 


m=k 


Vo., 


( 8 ) 

(9) 


where the explicit expressions for the terms on the right-hand side in (8) and (9) are given in 
the next lemma. The proof can be found in Lemma 2 of [33]. 


Lemma 2. For n-k even: 

u 0 , n = (l/k)2 n a'^(l - a)^A Kn , (10) 

Vo , n = (l/k)Ta^(l-a)^A Kn , (11) 

where 

A k ,n = y/. , cos"^ 1 [jn/(2k)] sin [jn/(2k)\ (-1)^ . (12) 

By symmetry, if V - B, the wrong and correct cascade probabilities for agent n are v* Qn and 
u* 0n , respectively. The following lemma states some useful relations between the probabilities of 
cascade. 


Lemma 3. For e e I k , and any arbitrary agent n > k: 

1) If k is even, then u* 0n = if ) n+v v* ) n = v* 0n+1 for n even. 

2) If k is odd, then u* Qn = u* 0n+l ,v* 0n = v* 0n+l for n odd. 

3) The probabilities of correct and wrong cascade are related by: 

v* 0 ,Jul, = [a/(l-a)] k . (13) 

The proof of part 1) and 2) follows directly from (8) and (9), whereas the proof for part 3) 
follows by using the formulae from Lemma 2 in (8) and (9). 
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IV. Effect of varying observation error rates 

We next turn to studying the effect of the error rates on the cascade probabilities and agent 
welfare. 

A. Cascade probabilities 

Lemma 2 shows that the probabilities of wrong and correct cascade depend on k and a. From 
Lemma 1 and since a = /(e, p), it follows that for a fixed p, these probabilities are determined 
by the error probability e. 

The next theorem characterizes the effect of varying the error e on the probabilities of wrong 
cascade, u* 0n , and correct cascade, v* Qn , for an arbitrary agent n. 

Theorem 1. For e e 

1) The wrong cascade probability increases with e. 

2) The correct cascade probability decreases with e. 

Proof. Consider e < e" , both in I k , and let {Z'j n > 0 and {Z”}„> 0 be the two corresponding DTMCs 
on the same state space S = {-k,-k+ 1, ...,0, ...,k - 1 ,k). The following concept of stochastic 
ordering ([6]) compares these two DTMCs: 

Definition 5. Let X and Y be two discrete random variables taking values on the same set S 
and let x and y be their corresponding probability distribution vectors. X(x) is said to be larger 
than Y(y) in stochastic ordering, denoted by X > st Y(x > st y), if 

y\ > x ‘ - ^ f° r au j e ( i4 ) 

£—h>j t—h>] 

Definition 6. The DTMC {Z'j is said to be larger than the DTMC { Z”} in stochastic ordering, 
denoted by { Z'} > st {Z''}, if 

Z n > st Z", for all n > 0. (15) 

Definition 7. A transition matrix Q is said to be stochastically increasing if for all i, i - 1 6 S: 

Qi >st Qi - 1 ( 16 ) 

where Qi denotes the i th row of Q (which is a distribution vector for some random variable). 

The proof continues by noting that the corresponding right transition probabilities of the two 
DTMC’s satisfy a > a > 0.5. Thus, Qt >v Q'i for all i G S. Moreover, the transition matrices 
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for each DTMC are stochastically increasing, and both {Z'j and {Z"} start from the same state 0. 
Therefore, by Theorem 4.2.5a and equation (4.2.16) in [6], we have {Z'j > st {Z"}. By Defn. 6, 
it follows that at any arbitrary time n,Z' n > st Z I Let z and z be the corresponding probability 
distribution vectors at time n. 

Setting j = k in Defn. 5, we have v^ n - z k > z k = Vq” • Now, letting j = -k + 1 yields 
'Li>-k + 1 z'j > Tji>-k+i z- , so that u* 0n < u* 0 ' n . If fact, we can use a coupling argument to prove that 
equality does not hold for n > k, i.e., v* u is strictly greater than v*" for n > k. These details 
are shown in the Appendix E. Similarly, u*” n > u* Qn . Since Lemma 2 shows that u* 0n and v* 0n are 
continuous functions of e (in the specified interval), this completes the proof. □ 



Figure 4: Probability of wrong cascade for agent n, with p = 0.70. 

As a demonstration of Theorem 1, Fig. 4 shows the probability of wrong cascade when 
p = 0.70, for agents n = 5, 10 and 100. Notice that for any agent n, Lemma 1 shows that there 
exists e close enough to 0.5 that yields k > n. The probability of wrong cascade will thus go to 
zero for large enough e, as shown in the figure. Note also that for each agent n, as e increases, 
the probability of wrong cascade discontinuously decreases at a countable number of points (and 
continuously increases in between). These points correspond exactly to the values e* k for different 
choices of integer k > 2. 

B. Agent Welfare 

Let 7t n be the payoff or welfare of agent n. From Section II, we have n„ = 0 if A n = N, while 
if A n = Y, 7r„ is either 1/2 or -1/2 corresponding to V = G or V = B, respectively. All agents 
1 < n < k use their own signals; thus by averaging over V e {G, B\ they all have the same 
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welfare given by: 

E [7T„] = {P [An = Y\V = G\ - P [An = Y\V = B]} /4 
= (2p - l)/4 = F. (17) 

For agents n > k + 1 : 

E [n„\ = {P [A n = Y\V = G] - P [A n = Y\V = B ]} /4 
= F + [(1 - p)v* 0n _ x - pu 0n _ x ] /2. (18) 

For a fixed observation error e, (18) explicitly relates the welfare of an agent n, who faces the 
possibility of cascade, to the probability of wrong and correct cascades created by the immediate 
previous agent n — 1. Using results from Theorem 1, the following theorem summarizes some 
properties of the agents’ welfares. 


Theorem 2. With the same signal quality p and e e IV 

1) The expected welfare for each agent is at least equal to the expected welfare of his 
predecessors. Thus, E [n n \ > F and is non-decreasing in n. 

2) lim E [ 7r„ ] exists and equals: 

n—>oo 


ri(e) = F + 


1 


‘ + (W 


(19) 


3) For every agent n, E[n„] decreases continuously as e increases over Ik so that: 


limF [n n ] > E [n n ] > lim E = F 

e K + i 

Proof 1) When a cascade happens, every user takes the same action and so achieves the same 
expected welfare. Thus, we are left to show E [ tt„ ] > F for all n > k + 1. Using (18) and the 
form of vjU j, u* Qi _ v we only need to show: 

i fc i fc 

(1 - p)a~( 1 - a)~ - pa~{ 1 - a)~ > 0 , ( 20 ) 


which can be seen by noting the following: since et < e < et v we have: 


0 < 


1 — a 


< 


1 


< 


1 — a 


k+V 

k-4 


< 1 . 


( 21 ) 
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2) For (eR, let V 0 (t) and C/ 0 (0 be the probability generating functions for the first hitting time 
of state k and —k, respectively. Using these, the limiting welfare can be written as: 

n(e) - F = l - [(1 - p)V o(l) - pU 0 (l )\ . (22) 

Expressions for these generating functions are given in equations (47) and (48) in the Appendix 
D; evaluating these at t - 1 yields (19). 

3) For a fixed p, (17) shows that F decreases in e. The proof follows by using (18) and Theorem 
1. □ 

Theorem 2 suggests a few interesting observations. First, even though a model with no observa- 
tion errors yields maximum welfare for any agent, the welfare is not monotonically decreasing 
in e. In fact, as the given e approaches each e* from below, it is better to increase e to exactly 
e* k so that the welfare for each agent moves to the next local maximum. This comes from the 
fact that the probability of wrong cascade drops discontinuously as e crosses over e* k due to the 
structure of the underlying DTMC changing. Second, the first observation raises the question of 
when and how much observation error should be added, especially when some pre-set error is 
already present in this model. We will consider this question in the next section. 

V. When is more noise better? 

A. Asymptotic welfare 

Assume that there is a fixed observation error e in the model. Suppose a social planner is 
allowed to randomly change the history of observations with probability e s . This introduces an 
effect that is equivalent to increasing the observation error to 

Sotoi = <K1 - U) + (1 - e)e s = f(e, e s ). (23) 

Fig. 5 shows an example of the asymptotic welfare as a function of e s , when the pre-set 
observation error is e = 0.15 and p = 0.70. It is clear from this figure that the asymptotic 
welfare is maximized when e s ~ 0.12, i.e., more noise is beneficial. By part 1) of Theorem 2, 
this also maximizes the (Cesaro) average social welfare of the entire population. 
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p = 0.70, e = 0.15, e > 7Tfc ]00 



Figure 5: Asymptotic welfare as a function of additional noise. 


Whether more noise is beneficial depends on the existing noise 6 and on the signal quality p. 
To better understand this, for each k , define e I k such that adding some e s > 0 will improve 
the welfare of agent n if and only if e > Tk, n • Likewise, let T k oo denote this threshold for the 
asymptotic welfare. For the case in Fig. 5, since there exists ae s >0 that improves the welfare, 
it must be that e > r*, Note also that the optimal choice of e s corresponds to a discontinuity of 
the welfare, which from our previous analysis corresponds to making the effective noise, e tota i, 
equal to e* +1 . We formali z e this phenomenon in the following theorem. 


Theorem 3. Assume that e e I k : 

1) The asymptotic social welfare is maximized at either e s = 0 or e s = ( 'f 2 ‘ > 0. 

2) The latter case happens when c 2+] > e > r k ca > e* k , where 


Tk.oo 


1-2 p 


1 


I ,, £+1 

i + C—W 

v p ’ 


(24) 


Proof. 1) The proof of 1) follows by part 3 of Theorem 2. 

2) To prove 2), we need to find e s such that 11 ( 6 ) < U(e tota i). Using (19), we obtain the lower 
bound Tkoo- □ 
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Figure 6: Expected welfare for agent n with high signal quality. 



Figure 7: Expected welfare for agent n with low signal quality. 


Note that when it is optimal for the social planner to add e s > 0, the welfare of every individual 
agent is not necessarily improved. This effect is shown numerically in Fig. 6 and 7 with the 
same pre-set value e = 0.15, for high and low signal quality, respectively. These figures each 
shows the expected welfare for three agents corresponding to n = 5, 10 and 100 as a function 
of £ s . For high signal quality (p = 0.99), increasing e s is beneficial for all three agents, i.e., 
6 > max(T yt ,5,T£ j io>'Euoo)- On the other hand, for low signal quality (p = 0.51), increasing e s 
will increase the welfare of agent n = 100 and decrease the welfare of agents n = 5, 10, i.e., 
e > Eu oo, but 6 < minf/r^, 

B. Individual welfares in the low-noise region 

In Fig. 6 and 7, we observe that not all agents’ welfares are improved by adding noise. In this 
section, we take a deeper look and consider the low-noise region, defined as when the pre-set 
observation error e e I 2 . In this region, it takes only two (consecutive) agents to start a cascade 
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as in the noise-free model. We derive a few important results for the sequence {t 2 ) „}^L 1 , where n 
is the agent index. For notational simplicity, we suppress k = 2 and abbreviate t 2 ,„ as r„. Using 
part 3) of Lemma 3, (18) is rewritten as: 


E [j T n \ = F + ^ 



(25) 


Now using part 1) and 2) of Lemma 3, if n > k then E\n n \ = E\n„-\ ] is true for k and n, both 
odd or both even. Thus, we can simplify the welfare in (25) for the low noise region, as stated 
in the next lemma. 


Lemma 4. 

1 ) For 6 6 1 2 and m > 1 : 


E [77 2 m+2] — E [ 7T 2m+ 1 J 


(26) 


= F + \[( l ~ pW ~ P(\ ~ a) 2 \ 


1 - [2a{\ - a)]' n 
1 - 2a(\ - a) 


2) For 6 6 It, and m > 2: 


E [^2m + 1 1 E [2T2 m ] 


(27) 


(28) 



[(1 -p)a 3 -p( 1 - a) 3 ] 


1 - [3a(l - a)]”- 1 
1 - 3a(l - a) 


(29) 


Proof See details in the archived version in [38]. 


□ 


We remind the reader that by Theorem 2, the expected welfare E[n„\ is non-decreasing in n. 
Lemma 4 suggests that the welfare strictly increases between pairs of two consecutive agents, 
depending on whether k is odd or even. 

Note that for k = 2, agents n < 3 can never benefit from adding e s . Using Lemma 4, the 
following theorem characterizes the sequence {r„}“= 4 - I n particular, Theorem 4 says that the 
sequence {r„}“ =4 can be divided into even and odd sub-sequences, where both sub-sequences are 
decreasing and have the same limit as in (24) with k = 2. Thus, if adding noise improves 
the limit welfare, then it will benefit all but the first few finite number of agents. 

Theorem 4. For m > 2: 


January 25, 2017 


DRAFT 



23 


1) Both sub-sequences {r 2m } and {r 2m+ 1 } decrease and have the same limit r K „ and 

2) T2m + 1 ^ 2 " 2 ///. 

Proof. Here, we outline the sketch of the proof for r„, n e {4, 5, 6, 7} with a graphical illustration 
as in Fig. 8, which plots the welfares of these agents as a function of e. To show that part 1) is 
true, we essentially show r 4 > r 6 and r 5 > r 7 . These correspond to showing point A is higher 
than point C, and point B is higher than point D, respectively. To prove part 2), we essentially 
show that point C is higher than point D. The complete proof can be found in the archived 
version in [38]. 



Figure 8: An illustration of ordering for r„, n e {4,5,6, 7). 


□ 

Moreover, Theorem 4 also suggests an interesting fact: with the same signal quality p, e e X 2 , 
and m > 2, if adding noise benefits agent 2m it will also benefit agents 2m + 2,2 m + 4, . . .; and 

if adding noise benefits agent 2m + 1 then it will also benefit agents 2m and 2m + 3, 2m + 5, 

A consequence of this result is that while adding noise never benefits the first three agents, all 
other agents’ welfares, including agent 4’s, are better off as long as agent 5’s welfare is. 

VI. Adding noise versus using guinea pigs 

In this section, our goal is to study the trade-offs of two schemes (mentioned below) for 
welfare improvements. Assume that e e Ik and e > r*, 00 , thus increasing the observation error 
to e A * +1 is beneficial for the average asymptotic social welfare. We refer to this as Scheme 1. 
Extrapolating from Theorem 4, it stands to reason that in Scheme 1 there exists a smallest agent 
index 12 1 who benefits, and there also exists an agent index « 2 beyond which all subsequent 
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agents n > n 2 benefit. On the other hand, for agents n e (n\, n 2 ), some benefit and some do not. 
The second scheme we consider is motivated by [19] where the author restricts the first M + 1 
agents (called “guinea pigs”) to use only their private signals. The purpose of this scheme is to 
skew the initial state distribution toward a correct cascade, owing to the informativeness of the 
signal quality. Thus, the asymptotic social welfare is improved at the cost of the guinea pigs’ 
payoffs. We summarize these two schemes as follows: 

1) Scheme 1 : Increase e to the next threshold, e* k+l (keeping M = 1 fixed). 

2) Scheme 2: Sacrifice the first M + 1 agents as guinea pigs (keeping e fixed). 

In Sections VI-A and VI-B below, we characterize the expected welfare for each of these 
schemes by using (25) and modifying the corresponding probability of wrong cascade. 


A. Scheme 1: Adding obsen’ation errors 

As we increase e to e* +1 , note that (j^) = and ^ , for all n > 1. Thus, (25) 

becomes: 


E[n „\ | e=e . 


F + 


1 Ik 


- 1 


* 0 , 71 - 1 ' 


(30) 


B. Scheme 2: Sacrificing guinea pigs 

Using notation from [19], assume that there are M+ \ guinea pigs who have to use only private 
signals. 11 For these agents, we have E[n n ] = F, V/? = 1, 2, . . . , M + 1. The effect of having M + 1 
guinea pigs is equivalent to changing the distribution for the initial state of the Markov chain 
and then starting this chain with agent M + 2. For each i e {-M - 1, —M , . . . , M, M + 1], denote 
A; as the probability that the M + 1 guinea pigs result in h M + i = i for the subsequent agents 
n > M + 2. For agents n > M + 2, let u* n be the probability of wrong cascade of agent n, 
conditioned on h M +i = i- Their corresponding expected welfare can be written as: 



U: 


i,n - 1 ’ 


(31) 


n In [19], the first agent always follows his own signal and so M is the number of additional agents forced to follow their 
own signals. In our case, more than the first agent may always follow their signals, depending on the noise, but we still use 
M + 1 to denote the total number of agents that do this to be consistent with [19]. 
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where the initial distribution for h M +\ = i with M + 1 - i even is: 

(M + 1\ M+l+i M+l-i 

m + i -i)a 2 (1-fl) 2 , 


A; = 


(32) 


and A, = 0 for M + 1 - i odd. 

For -k < i < k, again using techniques from [1], the probability of a wrong cascade first 
occurring at time n > M + 2 starting from state i is given by: 


= < 


0, h - (k + i ) odd, 

k+i 

j (^) 2 (a(l-a))? B k ~ n , n - (k + i ) even, 

/ vn(k+ 

\~^IF 


(33) 


where B kn = £y=i cos" 1 sin sin and h = n - M - 1. Therefore, the probability of 


wrong cascade for an agent n is: 


1 , 

0 , 

0, 

r 


m=k+i 


i < -k, 
i > k, 
h < k + i, 
h> k + i. 


(34) 


m-(k+i ) even 

Thus, revisiting (31), the summation in the welfare of every agent n can be further divided 
into three components corresponding to three different ranges of i: i < -k, -k < i < k, and i > k. 
Whether all these three ranges exist depends on the values of M + 1 and k. It is obvious that 
for a fixed k, letting M + 1 increases increases the number of terms for which i > k. In other 
words, for high enough M, the initial state after the last guinea pig is more likely equal to the 
correct herding state, thus improving average social welfare. However, this happens at the cost 
of sacrificing the first M + 1 agents. In the next section, we highlight this trade-off and offer an 
alternative solution that combines the two schemes. We refer to this as: 

3) Scheme 3: sacrificing the first M + 1 agents as guinea pigs, and at the same time increasing 


6 to e, 


k+i- 


C. Comparisons and trade-offs 

1) Individual welfare: Figures 9-12 show numerical results for the three schemes mentioned 
above. 
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Each figure shows E[n n ] versus n for these schemes; the choice of p and e varies in different 
figures. In Figs. 9 and 10, the signal quality is high, and the initial noise is high (so that k = 8) 
and low ( k = 2), respectively. Figs. 11 and 12 show analogous curves with a low signal quality. 
In each figure, the curves labeled M = m for some integer m correspond to Scheme 2 with m + 1 
guinea pigs (where M = 1 is the baseline case since the first two agents always follow their own 
signals). Scheme 1 is shown in the curve labeled “adding noise" and Scheme 3 is shown in the 
curve labeled by “M = k and adding noise," where again k + 1 is the number of guinea pigs. 
Note that trade-offs exist for various values of p and e. For high M, Scheme 2 eventually gives 
better individual welfare as compared to Scheme 1. However, the larger the observation error e 
is, the more agents that need to be sacrificed. This trade-off is alleviated by using Scheme 3, 
which yields better welfares while sparing almost half of the would-be guinea pigs. 



Figure 9: Agent n welfare for p = 0.99, k - 8, e - 0.357 e (t^iq,^). 



Figure 10: Agent n welfare for p = 0.99, k = 2, e — 0.08 e (r*, 5, 63). 
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Figure 11: Agent n welfare for p = 0.51, k = 8, e — 0.437 e (t^io,^). 



Figure 12: Agent n welfare for p = 0.51, k — 2, e — 0.245 6 (r*, 5 , 63 ). 


2) Asymptotic welfare: By the law of large numbers, as M + 1 increases, the probability that 
agent M + 2 starts a correct cascade increases to 1. Thus, for large enough M, Scheme 2 achieves 
better asymptotic welfare as compared to Scheme 1. In this section, we aim to give a lower bound 
on the minimum number of guinea pigs, M + 1, that makes Scheme 2 asymptotically better than 
Scheme 1. 

Proposition 1. For e e Ik and e > T k oo Scheme 2 achieves higher asymptotic welfare than 
Scheme 1 only if M + 1 > k. 

Proof. We refer to the case where the social planner does nothing as Scheme 0 (i.e. neither uses 
guinea pigs nor increases e). Assume that M + 1 < k. For any e 6 I k , by Lemma 1 all agents 
77 = 1 , ..., k must use private signals even when agent n is not a guinea pig. By this argument, 
for a given sequence of private signals, Scheme 2 and Scheme 0 have identical samples path, 
thus yielding identical asymptotic welfare. Moreover, since e > Tk,oo, Scheme 1 is better than 
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Scheme 0. Thus, the number of guinea pigs, M + 1 needs to be strictly higher than k to achieve 
better asymptotic welfare. □ 

As a consequence of Proposition 1, as e — » 0.5, i.e., a — > 0.5, we have k — > oo. Thus Scheme 
2 cannot give better asymptotic welfare for a finite number of guinea pigs. For k = 2, 3, . . . , 10, 
Figs. 13 and 14 illustrate how the required minimum M (obtained numerically) increases as e 
approaches to 0.5 for high and low signal quality, respectively. In addition to plotting this for 
each given k, we also show a curve that connects the maximum optimal M for each k, which is 
clearly increasing in e. This shows that for high noise levels, Scheme 2 is only viable if one is 
willing to sacrifice a large number of initial users. 



Figure 13: p = 0.51, from left to right: k = 2,3 , ..., 10 



Figure 14: p = 0.99, from left to right: k = 2,3 , ..., 10 


VII. Cascades with no common database 

So far, our model has assumed that there is observation noise by letting the agents report 
their actions on a common database which all other agents can see. Even though the reporting 
process is in error with probability e, the observation errors are common knowledge. In other 
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words, what one agent observes reflects exactly what any of the previous agents observed. In 
some settings, this may be a wrong assumption, e.g., if agents may pull out information using 
different methods or through different channels. Additionally, such a common database may 
not be valid in other social networks settings different from online recommendation systems. 
In this section, we step away from this assumption and look at in some sense the opposite 
case. In particular, we consider the scenario when each agent individually observes an erroneous 
realization of the previous agents’ true actions, where all errors are assumed to be i.i.d. Again, 
the distribution of the errors is still common knowledge. In such a scenario, not only are the 
agents blind to other agents’ private signals, but they also do not know what their predecessors 
have observed. 

A. Will cascades persist? 

We remind the reader that the key to previous analysis of cascades with a common database is 
that every subsequent agent knows the exact answers to two questions: 1) Has a cascade started 
yet? and 2) If the answer to the previous question is Yes, then when did that cascade start? The 
reasons are two-fold: 1) The distributions of the prior knowledge about V, the private signal, 
and the observation errors are given; and 2) A common observation database is available to all 
agents. Without the second assumption, no agent can infer if and when herding has started. 

Again, since the pre-cascading actions are independent, Property 1 still holds as in the common 
database scenario. However, the following property shows that, without a common observation 
database, herding is not recurrent. 

Property 2' Without the common database, a cascade needs not be permanent. 

Proof Assume that the statement is not true. We will provide a counter-example to this using 
just four agents. Denote O'- as the observations of agent i about the action A,-. Again, assume that 
O’- is a noisy version of Ay, so that O'- differs from A ; in an i.i.d. manner with error probability 
€ for all agents i and their predecessors 1 < j < i - 1. Now the information set of each agent i 
includes his private signal S ; and his observations 0 \ , . . . , 0\_ x of the true actions A 1? . . . , A,., . 

The decision rule for agents 1 and 2 is to follow their private information, the same as before. 
Since the first two agents both use their own signals, agent 3 takes account of the independence 
of 0\, 0\. Thus, agent 3’s decision rule remains the same: herds only if 0\, O] are identical and 
e < e 3 *. Now assuming that e < e 3’ the Bayesian update of agent 4 is: 
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<y' v o\ a , ,a 2 0 \,oi 



Figure 15: Different realizations of observations history for agents 3 and 4. 


P[Of,<9f,Of|V = G]P[S 4 |V = G] 

y 4 = . (35) 

numerator + P [Of, O f, 0 4 \V = B]P[S 4 |F = B] 

Now, conditioning on V e jG.fi), we can write: 

P [Of, Of, O f] = P[Of|Of, Of]P[Of, Of] 

= (1 - 6)P[A 3 = Y, Of, Of] + eP[A 3 = N, Of, Of], (36) 


where (36) holds if Of = Y. For Of = N, e and 1-6 are interchanged. The calculation of y 4 is 
possible if we can evaluate P[A 3 ,Of,Of] for both A 3 = Y,N and V = G,B. This can be done by 
conditioning on the 4 possibles realizations of agent 3’s observations as illustrated in Fig. 15. 

As an example, consider the case the information set of agent 4 is {5 4 , Of, Of, Of} = {0, Y, Y, N}. 

The above procedure gives: 

_ [e<?G + (1 - e)Pc + 2(1 - a)8 G ] (1 - p ) ^7) 

numerator + [ea B + (1 - e)p B + 2a6 n \ p' 

where subscripts correspond to true value V e ]G, B}, and: a G = [a - 6(1 - 6)] 2 , a B - [(1 - a) - 6(1 - 6)] 2 , 
Pg = Pb = [e(l - 5 g = 6(1 - 6) [a - 6(1 - 6)], 5 b = 6(1 - 6) [(1 - a) - 6(1 - 6)]. Based on 
the decision rule in (2), one would need to compare y 4 and 1/2. Using the assumption that 
0 < 6 < 6f, which means ^ , we deduce y 4 < 1/2, thus agent 4 follows his own 

signal and chooses N. Applying similar procedure for the remaining realizations of agent 4’s 
observation history, we have the decision rule shown in Table I. 

Thus agent 4 herds only when all of his three observations are identical. Now given that agent 
3 cascades to the first two agents’ action, agent 4 would choose the same action only if there 
were no observation errors, i.e., with probability (1 - 6) 3 , which is bounded away from 1. Thus 
in this scenario the cascade does not persist with probability one. □ 
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{S 4 ,Oj,C&0j} 

a 4 


^4 

1 YYY 

Y 

1 YYN 

Y 

OYYY 

Y 

0 YYN 

N 

1 YNY 

Y 

IYNN 

Y 

OYNY 

N 

OYNN 

N 

INYY 

Y 

INYN 

Y 

ONYY 

N 

ONYN 

N 

1NNY 

Y 

INNN 

N 

ONNY 

N 

ONNN 

N 


Table I: Decision rule of agent 4. 


B. Complexity analysis 


We now discuss the complexity of updating the posterior probability for an arbitrary agent n 
without a common database. Assume that e e I k as before. We have the following: 

1) For agents n = 1, . . . ,k, |/i„_i| < k thus cascades cannot happen. The decision rule is simple: 
always follow one’s signal. Thus complexity is (9(1). 

2) However, any agent n > k faces the possibility that some prior agent was in a cascade. While 
agent n’s decision rule remains the same as in (2), the complexity of his Bayesian update of 
the posterior probability, y n , increases exponentially as n grows. In fact, conditioned on his own 
observations sequence, every agent n needs to account for 2"“ 2 possible observations sequences 
of agent n — 1. For each one of those 2" -2 possibilities, agent n needs to know the decision rule 
of agent n— 1, which depends on another 2" -3 possible realizations of agent /7-2’s history. Since 
the pre-herding agent n = k ' s update is of order Oik) (including first calculating the sufficient 
statistics h n -\ and then comparing that to +k ), the complexity of the update for agent n > k is: 

f n - 1 \ 



O(k) = <9(k2" 2 -* 2 ). 


(38) 


\m=k / 

which blows up exponentially as ( n - k ) increases. 

For comparison, we also evaluate the complexity when a common database is available. For 
agents n = 1 the complexity is also (9(1) by simply checking if n < k. For any agent 
n > k, the decision process is done by simply computing the sufficient history statistic h n -\ and 
comparing that with ±k at each step. Note that |/z„_i| never exceeds k throughout the updating 
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process, and so in the worst case agent n endures a complexity of 0(n). This illustrates a benefit 
of having a common online platform for aggregating information. On one hand, such a platform 
lets agents get more information than they probably could by just having their own observations. 
On the other hand, it also provides a computational advantage such that if there is noise present, 
having this common platform makes it easier for agents to learn. The complexity incurred without 
a common database for Bayes-rational agents suggests that in such case, it is worth considering 
non-Bayesian approaches in the same spirit as [26]. We leave such approaches for future work. 

VIII. Other Generalizations 

In this section, we discuss several other generalizations of our model. We start next with 
examining different tie-breaking rules. 

A. Comparison of different tie-breaking rules 

When an agent’s postierior probability satisfies y„ = 1/2, its expected pay-off under either 
action is the same. In [9], the authors assumed that when such a tie occurs, agents randomize 
their choices. Another option for breaking a tie is to follow the previous action whenever a tie 
happens. In our model, we instead assume that an agent chooses to follow his own private signal 
when he is indifferent. Our assumption not only simplifies the analysis, but as we state in the 
next proposition, also is the better tie-breaking rule as compared to the other two: by following 
their own signals, agents who face a tie have lower wrong cascade probabilities and therefore, 
higher welfare. 

Proposition 2. For a given integer k > 2: 

1) When e 6 (e/,e/ +1 ), all agents are never in a tie. Thus all three tie-breaking rules perform 
equivalently. 

2) When e = e* k , agents who face a tie achieve the lowest (resp. highest) wrong cascade 
probability if they follow their own signals (resp. follow the previous action). 

Proof. We start by reminding the reader that a tie happens when the posterior probability satisfies 
y n = 1/2, i.e., there exists an integer k > 2 such that To prove the first part, note 

that when e e (e/,e/ +1 ), the above equality cannot happen. Thus all tie -breaking rules give the 
same outcome. 
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To prove the second part, assume that e = e* k for some k > 2. If an agent n encounters a 
history with \h n -\\ - k - 1 that is opposite to his private signal, he has to resort to the tie- 
breaking condition. If he follows the previous action, a cascade happens where the absorbing 
states of the underlying Markov chain are ±(k- 1). On the other hand, if he follows his private 
signal, a cascade does not happen yet, as the absorbing states of the underlying Markov chain 
are ±k. Using the equations (12)-(8), the latter cases give a lower wrong cascade probability and 
higher correct cascade probability. As a consequence by (18), when agent n follows his own 
signal, he has a higher welfare. Moreover, this also yields a higher asymptotic welfare as seen 
from (19). This completes the proof. □ 

The intuition behind this result is as follows: when the tie-breaking rule is to follow an agent’s 
private signal, the underlying Markov chain has the two absorbing states expanded by one unit on 
both sides as compared to when following the previous action. This induces a longer expected 
time until a cascade event (on either side). However, in both scenarios the chains have the 
same drift, which is always in favor of the correct cascade. This effect decreases (increases) the 
wrong (correct) cascade probabilities. This can also be seen in Fig. 4: each point of discontinuity 
happens exactly when k is increased by 1 unit where the probability of wrong cascades drops. 
If agents follow the previous actions, this yields the upper end of the discontinuity; while if 
agents follow their private signals, this yields the lower end of the discontinuity. Moreover, any 
randomized tie-breaking rule results in a linear combination of these two end points, which also 
yields a higher wrong cascade probability than when agents follow their own signals. 

B. Cascades with bounded rationality 

Next, we consider the possibility that agents might make mistakes in choosing their optimal 
action. This captures a more realistic assumption since it is well documented that real-world 
agents are not always rational. In fact, the reasons underlying human irrationality have been 
well argued in [20] where the authors presented the numerous cognitive deficiencies that could 
account for human systematic deviation from the perceived normative behaviors. Further, from 
the viewpoint of Prospect Theory ([5]), the action error can also be argued to be a consequence 
of inconsistency in preferences when agents tend to act differently under the possibilities of gains 
(facing a correct herding) and losses (being in a wrong herd). Finally, such a deviation from 
full rationality has also been used for equilibrium selection, in particular, for trembling-hand 
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perfection equilibrium ([4]). 

Indeed, there is a long history of using noise to model imperfect rationality (see e.g. [8], [14]) 
resulting in agents having so-called “noisy best responses.” Here, we follow the approach in 
those works, and examine a simple form of bounded rationality. In particular, assume that all 
agents derive the exact posterior probability of the true value of the item, but then choose the 
sub-optimal action with probability e a e (0,0.5); we call this an “action error.” Assume that this 
probability is known to all other agents, and is the same across all agents. Again, as before, 
assume that the observation database is still common knowledge, with i.i.d. observation errors 
with probability e 6 (0, 0.5). 

The analysis is similar as in the setting with no action errors, so we omit the details. The reason 
being we can combine the two types of errors, e and e a , to an equivalent error type, f(e, e a ), and 
use that in updating the posterior probability as before. As a result, the probabilities of wrong 
and correct cascades, u* 0n and v* 0n , respectively, are as those in a model with only observation 
errors of rate f(e, e a ). However, the agent welfares are now reduced due to the probability of 
choosing the non-optimal action, e a . In particular, for agents 1 < n < k, they all have the same 
welfare given by: 

E [n n \ = [P [A n = Y\V = G] - P [A n = Y\V = B ]} /4 
= (1 - 2e a )(2 p - l)/4 = (2a ] - l)/4 = F { . (39) 


where a x = f(e a ,p). 

For agents n > k + 1 : 


E [n n \ = [P [A„ = Y\V = G] - P [A„ = Y\V = B]} /4 


F\ + [(1 - ~ Ca)v o,„_i - ( fll - ^)Mo,n-l] / 2 


= F l +( 1 - 2e a ) 


1 ~ P * 
2 



(40) 


By comparing (39) and (40) against (17) and (18), it is clear that the action error e a only 
reduces the welfare of every agent by a factor of 1 - 2e a as compared to a model with only 
observation errors with probability /(e, e a ). Thus, the expected welfare of agents in a quasi- 
rational model inherits many properties from the fully rational setting with noisy observations 
as in Theorem 2 (e.g., the overall welfare is not monotonic in the error rate). 
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C. General prior for the true value, V 

In this section, we consider the scenario where the binary true value V of the item might not 
be equally likely, but the ex-ante pay-off is still restricted to be zero. 12 In particular, assume that 
P[V = G] = A G (0, 1). Let the rewards (resp. losses) of buying when V = G (resp. V = B) be x 
(resp. -y) where x,y > 0 and * + y > 0. In all previous sections, we have assumed .v = l,y = 0 
and A = 1/2. In this general scenario, setting the ex-ante payoff to 0 leads to the new cost of 
the item being C = Ax - (1 - A)y. Putting these all together, the general payoff is given by: 

0, if A n = N, 

Ttn = < X - C = (1 - A) (x + y) if A n = Y,V = G, and (41) 

-y - C - -A(x + y) if A„ = Y,V = B. 

The ex-post expected payoff of agent n when taking an action A„ e {T,A^} is calculated as: 


E [n n \A„ = N] = 0, and 

< 

E [n„\A„ = Y] = (x + y)(y n - T). 


(42) 


By (42), the new decision rule is based on comparing the posterior probability, y n , to A. However, 
as A changes, the posterior probability is a function of A and is given by: 

Yn = nv = G\S n ,HA = t n \ YJ (43) 

Thus, the new (user) optimal decision rule compares fij-n- i and 1. Moreover, since yS„,4-i are 
conditioned on V and, thus, do not change as A changes, the new decision rule with respect to 
fi n £ n - 1 is the same as before. Conditioned on V , the transition probabilities of the underlying 
Markov chain are identical to the case A = 1/2. As a result, conditioned on V, the probabilities 
of cascades for an agent n are also given by: 


jp [wrong\V = G] = u* 0n = 1 - v* „, and 
}p [wrong\V = B] = v* 0n = 1 - u* 0n . 

However, the (unconditional) probabilities of wrong and correct cascades are given as Au* 0it + 
(1 - d)Vo ;; and Av* 0n + (1 - A)u* 0n , respectively. Given that A, x,y are fixed, the welfare for agents 


12 The case of zero ex-ante payoff is the interesting case as at the outset agents have no preference for either action, and thus 
the additional information from the signals and past observations could be helpful. 
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are scaled by a constant factor as compared to when V is equally likely G or B. We show this 
next. All agents 1 < n < k, who use their own signals, have the same payoff: 

E [n„\ = (1 - A)E \n„\V = B] + AE [n„\V = G] 

= (l-A)(-A)(x+y))P[A n = Y\V = B] 

+ 4(l-4)(* + y)P[A„ = Y\V = G] 

= A(l-A)(x + y)(2p-l) = F 2 . (45) 

For agents n > k + 1, who face the possibility of cascades: 

E [n„\ = A( 1 - A)(x + y) {P [A n = Y\V = G] - P [A n = Y\V = 5]} 

= 4(1 - A) (x + y) [vq fJ _ 1 + p( 1 - - <„_,)] 

- 4(1 - A){x + y) + (1 - p)( 1 - - v^)] 

= F 2 + 24(1 - 4)(x + y) [v5^_! - . (46) 

Thus, comparing (45) and (46) to (17) and (18), the welfare of all agents are scaled by a factor 
of 44(1 - A){x + v). If x + y > 0 is fixed while 4 can change in (0, 1), the welfare are maximized 
if the true value V is equally likely G or B, i.e. 4 = 1/2. 

IX. Conclusions and future work 

This paper studied the effect of observation error in a simple Bayesian information cascade 
where agents can be either rational or quasi-rational. We showed that despite the presence of 
observation errors, cascades happen in finite time with probability 1. However, observation errors 
can delay the onset of a cascade; we also determined the error thresholds that increase this delay. 
In addition, cascades with observation errors have the same level of fragility as when such errors 
are not present, and in both cases they can be broken by an additional private signal. Using a 
Markov chain based analysis we determined the probabilities of cascades for an arbitrary agent 
and used these to calculate the agents’ welfare based on the given signal quality and the error. 
Our main result shows that for certain ranges of parameters, adding a controlled amount of 
observation error can lead to higher welfare for all but a finite number of agents. In such 
scenarios, we compared and contrasted the trade-offs of different methods for improving the 
agents’ welfares. These results may be helpful for a platform operator who has the option to 
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weigh the trade-offs and choose the best method that would benefit his platform. Our results are 
strongly based on the existence a common database, which stores all the erroneous observations 
of past actions. Moreover, we argued that such a database provides an important computational 
benefit to agents in performing Bayesian updates. In future work we plan on generalizing to 
agents with heterogeneous private signals and observation errors, and allowing each agent to 
only observe subsets of past agents’ actions prior to taking actions. 
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Appendix A 
Proof of Property 1 

We show the following equivalent claim: if a cascade has not happened yet, then opposite 
observations cancel out. Consider an arbitrary, non-cascading agent n. Since any agent i 6 
{1,2, ...,n - 1} uses his own private signal for the decision, we have: P[(9, = Y\V = 1] = P[(9, = 
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N\V = 0] = a, P[<9, = A|V = 1] = V[Oj = Y\V = 0] = 1 - a. Since O u 0 2 , ..., O n ^ are mutually 
independent, and independent of S n , the posterior probability of agent n is: 

y„{S n , 0\, ..., 0„-i) = P[V = G\S n , 0 \, ..., O n -i] 

nno i \v = Gmsn\v = G] 

_ i=l 

n - 1 

numerator + n P[(9,|V = £]P[S„1V = B] 

(=i 

where all the opposite observations cancel out in both numerator and denominator. Similar 
reasoning holds for the case V - B. □ 


Appendix B 
Proof of Property 2 

The key is that since the observation database is available to all agents, every agent knows 
whether and, if yes, when a cascade happens. Thus, the observations from the onset of a cascade 
provide no information to subsequent agents, and are ignored. Moreover, since all private signals 
and noisy observations are i.i.d., subsequent agents have identical posterior as the agent who 
started the cascade. Now since all agents have the same preferences, they also have the same 
optimal action of cascading. Thus the cascade lasts forever. □ 


Appendix C 
Proof of Lemma 1 


It is sufficient to derive e* k from the scenario n = k and { S „ , 7T„ } = {0, Y, Y , ..., T} and we will 
do this by induction. For agent n = 2 with the information set {0, Y\, Lemma 1 claims that agent 
2 does not herd as long as e > 0. Indeed: 


72(0, Y) = 


(1 - p)a 


< 1/2 Ve > 0. 


(47) 


(1 - p)a + p{ 1 - a) 

By the decision rule in (2), agent 2 chooses N and does not start a cascade. For agent n = k > 2 
with information set {0, F, F, ..., F), assume that e > e* k _ v i.e., all previous k- 1 agents do not 
cascade. Similarly, using Bayes’ rule gives: 

(1 -p)a k ~ l 


y„(0, F, ..., Y) = 


(48) 


(1 - p)a k ~ x + p( 1 - a) k ~ l 
Agent n does not cascade if y„(0, T, ..., Y) < 1/2, i.e., (1 - p)a k ~ x < p( 1 -a) k ~ l . Now, since 


1 -a _ (l-g)(l -p)+ep _ (1 -£)<?+<= 
a — (l-e)p+e(l-p) ~ (l-e)+eo’ 


, we arrive at the form of (5) as in Lemma 1. 


□ 
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Appendix D 
Proof of Lemma 2 

The proof follows using techniques from [1], Let T_ ki and T ki be random variables denoting 
the first time the Markov chain hits the absorbing states —k and k, respectively, starting from state 
i. Thus, u iM = P[T-k,i = n] , v^ n = P [T ki = n\. Let Ui(t),Vi(t) be the corresponding probability 
generating functions, where (eR. We have: U;(t) = E[t T u ] = £“ =0 u^ n t n , V/(0 = E\t T,J \ = 
Z,T=o v Unt n - With one-step transition probabilities being a and 1 - a, we have the the difference 
equations: 

Ui(t) = atU M (t) + (1 - a)tU^{t), (49) 

Vi(t) = (1 - a)tV M (t) + atVi-xit), (50) 

where -k < i < k, with the boundary conditions: U- k (t) = 1, U k (l) = 0, V~k(t) = 0, V k (t) = 1. The 
solutions are: 

Ui(f) = [K k (t)A 2 2 k (t) - A] k (t)A l + k (t)\ / [Af(t) - Af(t )\ , (51) 


Vi(t) = [. A[ +k (t ) - Af(t)\ / [A] k (t) - Af(t )\ , 


(52) 


where A l 2 (t) = [l ± -^1 - 4a(l - a)t 2 j /(2 at). Since the Markov chain starts at state i = 0, we 
find w 0 ,n and v 0 ,„ by: 


d n U 0 (t) r d»V 0 (Q 

nl(dt) n '~ 0, n\{dt) n 


(53) 


which can be written in closed-form as in (10) and (11). 


□ 


Appendix E 

Coupling method for the proof of Theorem 1 

Here we provide more details on the coupling method used to show that the inequalities of 
cascade probabilities are strict in the proof of Theorem 1. Using Proposition 1.10.4 in [6], there 
exist two random variables Z x n _ x and Z 2 _ ] on a common probability space such that P (Z' l _ l = 
k) = P(Zj_j = k), P(Z”_j - k) - P(Z^_ ! = k) and the stochastic order also holds for Z[ , and Z 2 _ { 
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almost surely, i.e., Zj , > Z 2 _, a.s. Define two independent random variables A* and B„ that are 
also independent of Z|_, and Z 2 _ x by: 


1, with probability a , 

-1, with probability l -a. 

|0, with probability g -r, 

// 

-2, with probability 1 - q - r . 

I a 

Next, define the random variable A 2 by: 

A 2 n = A l n + l A i =[ B n . 


B„ 


(54) 


(55) 


(56) 


By (54)-(56), A\ > A 2 and A 2 n has the following distribution: 

1, with probability a , 

1 , with probability 1 - a ' . 
Using the above, define Z J n for j = 1,2 as: 


7 J 


Z 2 1 , 

n - 1 ’ 


if |Z 


n- 1 1 


k. 


(57) 


(58) 


\Z J . + A 1 , , otherwise. 

n— 1 n— 1 ’ 

Thus, Z,' ( and 7?- have the same distribution as Z n and Z”, respectively. Now, Pr(Z 2 = k) can be 
written as: 


P(Z„ 2 _! =k) + P (Z 2 _, =k-l,Al = 1). (59) 

Since Z i n _ ] > Z 2 _ p Z 2 _, = k implies Z\_ x - k. Moreover, Z 2 _, = k - 1 also implies Zj , = k - 1 
or Z^_j = k. Thus (59) can be decomposed as: 

P(t-i = k,Z\_ x =k)+ P(Z 2 _j = k - ft, = u A 2 = 1) 

+ P(Z 2 _, = k - ft, = fc - 1, A 2 = 1). (60) 

Since A 2 = 1 implies A l n = 1, (60) is smaller than: 

P(Z 2 _, = k,Zli =k) + P(Z 2 _ 1 = k - 1 ,Zt = *,Aj = 1) 

+ P(Z 2 _, = if - l,Zt = k - 1 ,Aj = 1). (61) 
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Now, note that: 


= k — 

l,Zli=k-l,A l n 

= i)< 

mi, = k- 


p(zh 

= k,Z ] n _ t = k) 




+ P(Z; 

l l= k-l,2i.i = 

k,A\ = 

i) < p(z,u 

= k). 

P(Z 2 = 

k) < 




ml, 

= k)+nzi, = k 

-UAl 

= i) = p(z ; ; 

= k). 


These imply P(Z" = k) < P(Z,j = k), i.e., v*" < v*' n . 


□ 


Appendix F 
Proof of Lemma 4 


First, equations (26) and (28) can be easily seen by using parts 1) and 2) of Lemma 3 on 
the welfare formula in (18). Second, equations (27) and (29) can be shown by simplifying the 
expression of u* 0n _ { for both e e I 2 and e e J 3 , respectively, then substituting into equation (25). 
1) Applying Lemma 2 for k = 2 and m > 1, we get: 


W0,2m+1 - 0 

< 

m 0 , 2 m = \2 2m a m -\i - a) m+l cos 2 '"- 1 (f ) sin (f) , 
which can also be viewed as a geometric series: 


(62) 


r 


«o , 2 = (1 - a) 2 


\^0,2m+2/ ^0,2m 2$(1 U) 


(63) 


This means m 0 , 2m = (1 - a) 2 [2a(l - a)]" 1 l , and: 


2 1 - [2a(l - a)T 
1 - 2m(1 - a) 

j even 

Now, applying (64) to (25), we obtain (27). 

2) Similar analysis applies for k = 3 and m > 2. 


2m 


U, 


0,2m+l 


i 0,2m 


^ u 0 j = (1 - a) 


(64) 


□ 
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Refer to Fig. 8. To find r„ for each agent n, we need to solve, for t„ e J 2 , the following 
equation: £[/r„]| e=rn = E[n n ] | e=e .. 

Clearly, we need to consider two cases: n = 2m and n = 2m + 1. Using the results from (27) 
and (29), we solve the following equations: 
a) For n = 2 m: solve E[n 2m \\ £=T2m = E\n 2m \\ e=e *, i.e., 


nra-1 


r „ , 1 - [2a(l - a)] 

[a(1 ~ a) - £l 1 -2a(l - a) 

b) For n = 2m + 1: solve E [n 2m+x \\ £=Tlm = E[n 2m+ i]| e=< , i.e., 


1 - N) 

i - 3e: 


m— 1 


(65) 


, ,i-[2ad-o)r 

[a(\ - a) - 6]— — — — = (e 3 ) 


m— 1 


1 - 3e( 


(66) 


1 - 2r/(l - a) 

The detailed proof is continued as follows: 
i) Showing that {r 2m } is decreasing: 

Consider r 2m and r 2m+2 . Using (65), to show point A is higher than point C, we need to show: 


/ \m 

1 - [ 2 fl 2w (l - aim)]"' < 1 - ( 3 e 3 j 
1 ~ [2a 2m (l - a 2m )] m_1 < \-(3e;) m ~ r 

where a 2m = f(p, r 2m ). 

Let g(.r) = -p^rr, for 0 < x < 1. Then g(x) is increasing. Now note that 63 = a* 3 ( 1 - a* 3 ) > 
a 2m ( 1 - a 2m ) ^ 3e* > 2a 2 „,(l - a 2m ), thus (67) is proved, i.e., r 2m > r 2m+2 . 
ii) Showing that {r 2 m+ i} is decreasing: 

Consider r 2m+ i and r 2m+3 . Using ( 66 ), to show point B is higher than point D, we need to 
show: 


/ \m 

l-[2a 2m+3 (l-q 2w+3 )] w+1 < 1 ~ ( 3 G*j 
1 - [2a 2m+3 (l - a 2m+3 )]" 1 1 _ ^3 6 *y"- 


where a 2m+3 = f(p, r 2m+3 ). Indeed, we have: 


1 


( 3e 3*) 


m + 1 




1 -Nr 


(68) 


(69) 
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